cd/entity/Charles Fryeยท homeโ€บ entitiesโ€บ Charles Frye
grep -l @charles frye /news/*.json | wc -l โ†’ 1

Charles Frye

mentions 1 type Person feed RSS
23:22
2026-06-11
modal.com
large-language-models

Making FlashAttention-4 faster for inference

Modal AI engineers Charles Frye and David Wang optimized FlashAttention-4 for large language model inference, focusing on decode-heavy workloads dominated by memory bandwidth-limited token generation.โ€ฆ

// co-occurs with top 3 entities